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Abstract 

When performing chemical data assimilation the observational, representativeness, 
and theoretical uncertainties have very different characteristics. In this study we 
have accurately characterized the representativeness uncertainty by studying the 
probability distribution function (PDF) of the observations. The average deviation 
has been used as a measure of the width of the PDF and of the variability (represen- 
tativeness uncertainty) for the grid cell. It turns out that for long-lived tracers such 
as N 2 0 and CH 4 the representativeness uncertainty is markedly different from the 
observational uncertainty and clearly delineates mixing barriers such as the polar 
vortex edge, the tropical pipe and the tropopause. 
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1 Introduction 


The key difference between conventional modelling and data assimilation is 
the use of observations and information on observational and other uncertain- 
ties. The uncertainties normally considered are the observational uncertainty, 
the representativeness uncertainty (i.e. the spatial variability over an anal- 
ysis grid cell), the background uncertainty, and the theoretical uncertainty. 
Accurately determining these uncertainties can be quite challenging. In this 
study we focus on the geophysical insights derived from examining the repre- 
sentativeness uncertainty of a chemical data assimilation system cast in flow 
tracking coordinates. 

All data assimilation systems require reasonable estimates of the observation 
error statistics for each observation system used in the assimilation. Without 
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Fig. 1. N 2 O Equivalent PV latitude - potential temperature cross sections of 
(a) representativeness uncertainty (v.m.r.), (b) observational uncertainty (v.m.r.), 
(c) obvservation (v.m.r.), and (d) analyses uncertainty (v.m.r.). The data used is 
from the Upper Atmosphere Research Satellite (UARS) Cryogenic Limb Array 
Etalon Spectrometer (CLAES) version 9 for January 1992. 


such estimates, no assimilation system can extract all the available information 
from the observations (Daley, 1993). 


In many data assimilation systems currently in use overly simplistic covari- 
ance models are used that cannot adequately describe state-dependent error 
components such as representativeness error (Dee et al., 1999). Cohn (1997) 
has highlighted the value of rigorous treatment of representativeness error and 
model error. The representativeness error is usually either ignored, treated as 
a constant, diagnosed by a time average of (Observations-Forecast) (O-F), or 
sometimes taken as the variance of observations over a region (Stajner et al., 
2001). Treatment of representativeness error is certainly an area which requires 
further attention in the future. Here we show that actually using the data to 
define the representativeness uncertainty rather than using assumptions can 
provide interesting insights. 
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Fig. 2. CH4 Equivalent PV latitude - potential temperature cross sections of 
(a) representativeness uncertainty (v.m.r.), (b) observational uncertainty (v.m.r.), 
(c) obvservation (v.m.r.), and (d) analyses uncertainty (v.m.r.). The data used is 
from the Upper Atmosphere Research Satellite (UARS) Halogen Occulutation Ex- 
periment (HALOE) version 19 for January 1992. 


2 Flow Tracking Coordinates 


Under adiabatic conditions air parcels move along isentropic surfaces (surfaces 
of constant potential temperature, 6 ). So when considering tracer fields 0 is 
a suitable vertical coordinate, since it acknowledges the likely vertical motion 
of air parcels. McIntyre and Palmer (1983, 1984), Hoskins et al. (1985), and 
Hoskins (1991) have shown the value of isentropic maps of Ertel’s potential 
vorticity (PV) in visualising large scale dynamical processes. PV plays a cen- 
tral role in large scale dynamics where it behaves as an approximate material 
tracer (Hoskins et al., 1985). 

As a result, PV can be used as the horizontal spatial coordinate instead of 
latitude and longitude (Norton, 1994; Lary et al., 1995). PV is sufficiently 
monotonic in latitude on an isentropic surface to act as a useful replacement 
coordinate for both latitude and longitude, reducing the tracer field from three 
dimensions to two. These ideas have already led to interesting studies corre- 
lating PV and chemical tracers such as N 2 0 and O 3 (Schoeberl et al., 1989; 
Proffitt et al., 1989, 1993; Lait et al., 1990; Douglass et al., 1990; Proffitt et al., 
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1989, 1993; Atkinson, 1993). A key result of these studies is that PV and ozone 
mixing ratios are correlated on isentropic surfaces in the lower stratosphere, 
as was first pointed out by Danielsen (1968). 

Since the absolute values of PV depend strongly upon height and the meteoro- 
logical condition, it is useful to normalise PV and use PV equivalent latitude 
(<f> e ) as the horizontal coordinate instead of PV itself. <f> e is calculated by con- 
sidering the area enclosed within a given PV contour on a given 6 surface. The 
<f) e assigned to every point on this PV contour is the latitude of a latitude circle 
which encloses the same area as that PV contour. Therefore, for every level in 
the atmosphere <f> e has the same range of values, -90° to 90°. This provides a 
vortex-tracking, and indeed a flow-tracking, stratospheric coordinate system. 

We have taken these now well established ideas and used them as a framework 
for our chemical data assimilation. This is certainly valid for our analysis 
interval of one day, and often for up to ten days or longer in the stratosphere. 
Because a major component of the variability of trace gases is due to the 
atmospheric motions it makes sense to use a co-ordinate system that ‘moves’ 
with the large scale flow pattern to perform our data assimilation. 


3 Representativeness Uncertainty 


In this study we seek to accurately characterize the representativeness un- 
certainty and improve the signal to noise by using all observations available 
within a grid cell and studying the probability distribution function (PDF) of 
these observations. The width of the PDF is used as a measure of the variabil- 
ity or representativeness uncertainty for the grid cell. When more than one 
observation is available in a grid cell we take the median of the PDF as the 
observation for that cell, and the median of the observation uncertainty PDF 
as the observation uncertainty for that grid cell. So both the observation and 
observation uncertainty used are directly from the data but selected in a way 
that we are sure they are truly representative of that grid cell. 

The criteria used to determine at what location we use an observation are 
equivalent PV latitude (<j> e ), and potential temperature (0). An observation 
is used in <p e -0 grid box where it lies. The grid used here has 21 potential 
temperature levels spaced equally in log(0) between 400 K and 2000 K, and 
32 equivalent PV latitudes spaced evenly between -90° and 90°. 

The uncertainty of the reconstructed observation has two components. First, 
for each 9) grid box we have a distribution of observed concentrations, and 
a distribution of observed concentration uncertainties. We take the observa- 
tional uncertainty to be the median observed concentration uncertainty for the 
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current distribution in the given 6) grid box. Second, the representative- 
ness is taken to be the average deviation of the concentration distribution for 
the given (<f> e ,9) grid box. The average deviation, or mean absolute deviation, 
is a robust estimator of the width of the distribution (Press et al., 1992). 

1 N 

<r tep = ADev(xi • • • Xn) = « S 1% “ x\ (!) 

iN j=i 

Let us now examine two examples of the representativeness uncertainty. 


3.1 N 2 0 


Figure 1 shows N 2 0 Equivalent PV latitude - potential temperature cross sec- 
tions of (a) representativeness uncertainty (v.m.r.), (b) observational uncer- 
tainty (v.m.r.), (c) observation (v.m.r.), and (d) analyses uncertainty (v.m.r.) 
for January 1992. The data used is from the Upper Atmosphere Research Satel- 
lite (UARS) Cryogenic Limb Array Etalon Spectrometer (CLAES) version 9 
for January 1992. 

We notice that the representativeness uncertainty in Figure 1 (a) has more 
structure in than the observational uncertainty in Figure 1 (b). The mixing 
barriers associated with the vortex edge (centered at around <f> e ~ 65°) and 
the tropical pipe (at <f> e « ±20°) are clearly visible. In each case this is close 
to sharp latitudinal gradients in N 2 0, and that this uncertainty is reflected in 
the analyses uncertainty. 


3.2 CH 4 


Figure 2 shows CH 4 Equivalent PV latitude - potential temperature cross sec- 
tions of (a) representativeness uncertainty (v.m.r.), (b) observational uncer- 
tainty (v.m.r.), (c) obvservation (v.m.r.), and (d) analyses uncertainty (v.m.r.) 
for January 1992. The data used is from the Upper Atmosphere Research 
Satellite (UARS) Halogen Occulutation Experiment (HALOE) version 19 for 
January 1992. 

We notice that the representativeness uncertainty in Figure 2 (a) has more and 
very different structure in than the observational uncertainty in Figure 2 (b). 
The representativeness uncertainty beautifully captures the mixing barriers 
associated with the vortex edge (centered at around <f> e ~ 65°) and the tropical 
pipe (at </> e « ±20°). In each case this is close to sharp latitudinal gradients in 
CH 4 , and that this uncertainty is reflected in the analyses uncertainty. Unlike 
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N 2 0, in the case of HALOE CH 4 the representativeness uncertainty also picks 
out the tropopause. 


3.3 Other Constituents 


Shorter lived, chemically reactive constituents have also been examined, and 
have a different behavior to relatively inert tracers such as CH 4 and N 2 0. 
Reactive species have large representativeness uncertainty in regions where we 
have considerable temperature, illumination, or other variations that impact 
the rates of reaction. For example, when using the Lagrangian flow-tracking 
coordinates described in the paper we have a large ozone representativeness 
uncertainty close to the summer pole. Such constituents were not considered 
here because of length constraints. 


3.4 Other Measures 


The ‘width’ or ‘variability’ of the PDF can be characterized in several ways. 
Two other common measures are the variance, or its square root, the standard 
deviation. Both of these measures have been tried and give essentially the same 
results. The reason for choosing the average deviation is that the variance 
and standard deviation depend on the second moment of the PDF. It is not 
uncommon to have a distribution whose second moment does not exist (i.e., 
is infinite). In this case, the variance or standard deviation is useless as a 
measure of the data’s width about a central value. This can occur even when 
the width of the peak looks, by eye, perfectly finite. The average deviation is 
a more robust estimator that does not suffer from this problem (Press et al. , 
1992). 


3. 5 Sampling 


In calculating the representativeness uncertainty we are assuming that the 
observations adequately sample the atmosphere. With the relatively low data 
volume from instruments such as HALOE, and the UARS yaw cycle, we con- 
sider a month of data at a time in flow-tracking co-ordinates to give a statisti- 
cally significant amount of data from which to calculate the average deviation. 
When flow-tracking coordinates are used in this way the representativeness 
uncertainty from both HALOE (a low data volume dataset) and CLAES (a 
high volume dataset) give similar results, suggesting that we have adequately 
sample the atmosphere. 
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The use of an entire months data will lead to a slight overestimate of the 
representativeness uncertainty due to the inevitable inclusion of some time 
variation. However, the choice of a Lagrangian coordinate system means that 
this is small, especially for constituents such as N 2 0 and CH 4 . In addition, the 
slight overestimate that results here in the representativeness estimate based 
on the data itself should be compared to no estimates or crude estimates of 
the representativeness uncertainty used to date, often not based on the data 
itself. 


4 Summary 


When performing chemical data assimilation the observational, representative- 
ness, and theoretical uncertainties have very different characteristics. In this 
study we have accurately characterized the representativeness uncertainty by 
studying the probability distribution function (PDF) of the observations. We 
have used average deviation as a measure of the width of the PDF and of 
the variability (representativeness uncertainty) for the grid cell. It turns out 
that for long-lived tracers such as N 2 0 and CH 4 the representativeness un- 
certainty is markedly different from the observational uncertainty and clearly 
delineates mixing barriers such as the polar vortex edge, the tropical pipe and 
the tropopause. 
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